Meta-knowledge Management in Multistrategy Process-oriented Knowledge Discovery Systems
نویسندگان
چکیده
Current electronic data repositories are growing quickly and contain big amount of data from commercial, scientific, and other domain areas. The capabilities for collecting and storing all kinds of data exceed the abilities to analyze, summarize, and extract knowledge from this data. Knowledge discovery systems (KDSs) use achievements from many technical areas, including databases, Data Mining (DM), statistics, AI, machine learning, pattern recognition, high performance computing, management information systems (MIS), decision support systems, and knowledge-based systems. Knowledge discovery is an innovative approach to information management and is associated commonly with the nontrivial process of identifying valid, novel, potentially useful, and ultimately understandable patterns and relations in large databases (Fayyad, 1996). Numerous data mining techniques have recently been developed to extract knowledge from large databases. Since present-day KDSs are armed with a number of available techniques to process data; and, potentially, there are many possible combinations of these techniques to construct a DM strategy for mining a current problem. In a real problem-solving situation it is not computationally feasible to apply every DM strategy. Therefore, dynamic selection of data mining methods in knowledge discovery systems has been under active study (see, for example, (Tsymbal, 2002)). However, at least two contexts of dynamic selection can be distinguished. First, the so-called multi-classifier systems that apply different ensemble techniques (Dietterich, 1997). Their general idea is usually to select one classifier on the dynamic basis taking into account the local performance (e.g. generalisation accuracy) in the instance space. Second, multistrategy learning that applies a strategy selection approach which takes into account the classification problemrelated characteristics (meta-data). We are interested in the second context in this study. Selection of the most appropriate DM technique or a group of the most appropriate techniques is usually not straightforward. Many empirical studies are aimed
منابع مشابه
The Relationship between Knowledge Management and the Process of Entrepreneurship in Sport Organizations
In the current competitive world, organizations can reach competitive advantage which support entrepreneurship by providing the required tools. One of the most important tools for developing entrepreneurship which was neglected in previous studies is organizational knowledge management. The present paper aims to shed light on the role and importance of knowledge management in sport entreprene...
متن کاملMultistrategy Data Exploration Using the INLEN System: Recent Advances
Recent advances in the development of the INLEN system for multistrategy data exploration are briefly reviewed. These advances include the development of a meta-level language for data mining and knowledge discovery, called knowledge generation language (KGL), and the employment of a new type of attributes, called structured attributes. These features are illustrated by an example concerned wit...
متن کاملA Methodology and Life Cycle Model for Data Mining and Knowledge Discovery in Precision Agriculture
This paper presents a methodology for data mining and knowledge discovery in large, distributed and heterogeneous databases. In order to obtain potentially interesting patterns, relationships, and rules from such large and heterogeneous data collections, it is essential that a methodology be developed to take advantage of the suite of existing methods and tools available for data mining and kno...
متن کاملA Multistrategy Learning Approach to Flexible Knowledge Organization and Discovery
1 Also with Lockheed Martin Federal Systems, Gaithersburg, MD. 2 Also with Science Applications International Corp., Tysons Corner, VA. Abstract Properly organizing knowledge so that it can be managed often requires the acquisition of patterns and relations from large, distributed, heterogeneous databases. The employment of an intelligent and automated KDD (Knowledge Discovery in Databases) pro...
متن کاملAqBC: A Multistrategy Approach for Constructive Induction
In order to obtain potentially interesting patterns and relations from large, distributed, heterogeneous databases, it is essential to employ an intelligent and automated KDD (Knowledge Discovery in Databases) process. One of the most important methodologies is an integration of diverse learning strategies that cooperatively performs a variety of techniques and achieves high quality knowledge. ...
متن کامل